A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
نویسندگان
چکیده
MOTIVATION Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein interaction networks that consider cellular context and competition between potential binders. RESULTS We developed a novel regression framework that considers both positive (quantitative) and negative (qualitative) interaction data available for mouse PDZ domains to quantitatively predict interactions between PDZ domains, a large peptide recognition domain family, and their peptide ligands using primary sequence information. First, we show that it is possible to learn from existing quantitative and negative interaction data to infer the relative binding strength of interactions involving previously unseen PDZ domains and/or peptides given their primary sequence. Performance was measured using cross-validated hold out testing and testing with previously unseen PDZ domain-peptide interactions. Second, we find that incorporating negative data improves quantitative interaction prediction. Third, we show that sequence similarity is an important prediction performance determinant, which suggests that experimentally collecting additional quantitative interaction data for underrepresented PDZ domain subfamilies will improve prediction. AVAILABILITY AND IMPLEMENTATION The Matlab code for our SemiSVR predictor and all data used here are available at http://baderlab.org/Data/PDZAffinity.
منابع مشابه
Inferring PDZ Domain Multi-Mutant Binding Preferences from Single-Mutant Data
Many important cellular protein interactions are mediated by peptide recognition domains. The ability to predict a domain's binding specificity directly from its primary sequence is essential to understanding the complexity of protein-protein interaction networks. One such recognition domain is the PDZ domain, functioning in scaffold proteins that facilitate formation of signaling networks. Pre...
متن کاملUncovering quantitative protein interaction networks for mouse PDZ domains using protein microarrays.
One of the principal challenges in systems biology is to uncover the networks of protein-protein interactions that underlie most biological processes. To date, experimental efforts directed at this problem have largely produced only qualitative networks that are replete with false positives and false negatives. Here, we describe a domain-centered approach--compatible with genome-wide investigat...
متن کاملA physical model for PDZ-domain/peptide interactions
The PDZ domain is an interaction motif that recognizes and binds the C-terminal peptides of target proteins. PDZ domains are ubiquitous in nature and help assemble multiprotein complexes that control cellular organization and signaling cascades. We present an optimized energy function to predict the binding free energy (ΔΔG) of PDZ domain/peptide interactions computationally. Geometry-optimized...
متن کاملDomPep—A General Method for Predicting Modular Domain-Mediated Protein-Protein Interactions
Protein-protein interactions (PPIs) are frequently mediated by the binding of a modular domain in one protein to a short, linear peptide motif in its partner. The advent of proteomic methods such as peptide and protein arrays has led to the accumulation of a wealth of interaction data for modular interaction domains. Although several computational programs have been developed to predict modular...
متن کاملEnergetic determinants of internal motif recognition by PDZ domains.
PDZ domains are protein-protein interaction modules that organize intracellular signaling complexes. Most PDZ domains recognize specific peptide motifs followed by a required COOH-terminus. However, several PDZ domains have been found which recognize specific internal peptide motifs. The best characterized example is the syntrophin PDZ domain which, in addition to binding peptide ligands with t...
متن کامل